| Name-Last Name:              |                     | St |       |       |                                          |              |                     |  |
|------------------------------|---------------------|----|-------|-------|------------------------------------------|--------------|---------------------|--|
|                              | Section (Check one) |    |       |       | Section 1 (Wednesday) Section 2 (Friday) |              |                     |  |
|                              |                     |    |       |       |                                          |              |                     |  |
| Hacettepe University         |                     |    |       |       | Co                                       | mputer Eng   | ineering Department |  |
| BBM234 Computer Organization |                     |    | Insti | ructo | r: Asso                                  | oc. Prof. Di | :. Suleyman TOSUN   |  |
| Midterm Exam                 |                     |    |       |       |                                          | Exa          | am Date: 15.04.2016 |  |
| Duration: 100 minutes        |                     |    |       |       |                                          |              |                     |  |
|                              |                     |    |       |       |                                          |              |                     |  |
| Ouestions                    | 1                   | 2  | 3     |       | 4                                        | 5            | Total               |  |

| Questions | 1  | 2  | 3  | 4  | 5  | Total |
|-----------|----|----|----|----|----|-------|
| Marks     | 20 | 20 | 20 | 20 | 20 | 100   |
| Earned    |    |    |    |    |    |       |

Suppose we have an array A with 100 elements. The memory address of the first array element is 0xA1B2C3D4. Write a MIPS function named ordered, which checks if the elements of the array A are in ascending order. If they are, the function should return 1. Otherwise, it returns 0. Call the function ordered from the main function. And then, store the return value in register s0.

Note that you do not have to store the variables in the stack when function is called.

**Important note:** You are allowed to use only the instruction we have seen in our lectures. You cannot use any other pseudo instructions. If you use any, you will lose 5 points for each pseudo instruction.

- **Q2.** You are given a MIPS program below.
  - a) Fill the given table for this program by entering the following:
    - How many times does each instruction execute (fetched) in the program?
    - What is the clock cycles of each instruction when executes on multi-cycle processor? Diagram of the multi-cycle processor is given in Question 5 if you need it
    - What are the total clock cycles for each instruction and the whole program when executed on multi-cycle processor?

|       | Instruction          | How many times does an instruction execute? | Clock cycles of the instruction for multi-cycle processor | Total clock cycles for multi-cycle processors |
|-------|----------------------|---------------------------------------------|-----------------------------------------------------------|-----------------------------------------------|
|       | addi \$s0, \$0, 2    |                                             |                                                           |                                               |
| loop: | beq \$s0, \$0, done  |                                             |                                                           |                                               |
|       | srl \$s0, \$s0, 1    |                                             |                                                           |                                               |
|       | bne \$s0, \$0, else  |                                             |                                                           |                                               |
|       | lw \$s5, 0(\$a0)     |                                             |                                                           |                                               |
|       | add \$s5, \$s5, \$s0 |                                             |                                                           |                                               |
|       | sw \$s5, 0(\$a0)     |                                             |                                                           |                                               |
| else: | addi \$s0, \$s0,-1   |                                             |                                                           |                                               |
|       | j loop               |                                             |                                                           |                                               |
| done: | jr \$ra              |                                             |                                                           |                                               |
| To    | tal (Single-Cycle):  |                                             | Total (Multi-Cycle):                                      |                                               |

b) Show the formats of the instructions beq (opcode=4) and srl (funct=2). Then, write machine codes for these instructions in hexadecimal format. (s0=16)

| Instruction         | Hexadecimal Machine Code |
|---------------------|--------------------------|
| beq \$s0, \$0, done |                          |
| srl \$s0, \$s0, 1   |                          |

- **Q3.** We would like to add **blez** instruction to single cycle MIPS processor. blez instruction copies the branch target address (BTA, which is *PCBranch* in Figure 2) to the program counter (PC) if the value in register [rs] is less than or equal to zero. In other words, if [rs]  $\leq 0$ , PC = BTA.
  - a) First, add an output signal to the ALU (given in Figure 1) and name this signal as LTEZ (Less Than Equal to Zero). LTEZ=1 when the Result is less than or equal to zero (i.e., Result ≤ 0). Otherwise, LTEZ=0. [5]
  - b) By using LTEZ output signal, show the necessary changes on data-path of single-cycle processors given in Figure 2 and explain your changes. [10]
  - c) Fill the control signals in Table I. [5]



Figure 1: ALU

Figure 2: Single Cycle MIPS Processor

| c) Tabel I: Control signals for <b>blez</b> instruct | 10 |
|------------------------------------------------------|----|
|------------------------------------------------------|----|

| Inst. | Op <sub>31:26</sub> | RegWrite | RegDst | AluSrc | Branch | MemWrite | MemtoReg | ALUOp <sub>1:0</sub> |
|-------|---------------------|----------|--------|--------|--------|----------|----------|----------------------|
| blez  | 000110              |          |        |        |        |          |          |                      |

**Q4.** a) In a MIPS architecture, indicate if the following instructions use sign extend logic or ALU. If an instruction uses ALU, what is the type of operation (add, subtract, and, or, ...etc)? [10]

| Instruction            | Uses sign extend (YES or NO) | Uses ALU<br>(YES or NO) | Type of ALU operation |
|------------------------|------------------------------|-------------------------|-----------------------|
| lw \$s0, 0(\$a0)       |                              |                         |                       |
| sllv \$s0, \$s1, \$s2  |                              |                         |                       |
| sw \$s0, 0(\$a0)       |                              |                         |                       |
| jal target             |                              |                         |                       |
| bne \$s0, \$s1, target |                              |                         |                       |

b) Suppose following delays have been determined for the elements of the single-cycle processor. You can ignore the delays of other elements in the design.

| Element             | Parameter      | Delay (ps) |
|---------------------|----------------|------------|
| Register clock-to-Q | $t_{pcq}$ PC   | 30         |
| Register setup      | $t_{ m setup}$ | 20         |
| Multiplexer         | $t_{ m mux}$   | 25         |
| ALU                 | $t_{ m ALU}$   | 200        |
| Memory read         | $t_{ m mem}$   | 250        |
| Register file read  | $t_{RF}$ read  | 150        |
| Register file setup | $t_{RF}$ setup | 20         |

b1) What is the worst case delay for R-type instructions? Show how you determine this value. [5]

b2) What is the worst case delay for beq instruction? Show how you determine this value. [5]

- **Q5.** In the multi-cycle processor given in Figure 3, lw instruction takes 5 clock cycles.
  - 1. Fetch: Fetch the instruction from instruction memory and store it into the instruction register.
  - 2. Decode: Read the operands such as registers and immediate values.
  - 3. Execute: Add operands (add rs and sign extended immediate value.)
  - 4. Read memory: Read the data from memory and store it into data register.
  - 5. Write result: Write the data from data register to destination register in register file.

Your goal is to *design a microarchitecture for lw instruction that takes only 3 clock cycles*. You should merge cycles Fetch and Decode into FetchDec and merge Execute and Read memory into (ExecMem). Write result cycle will be the same.

- a) Draw the data path using the 32-bit Program Counter (PC), instruction/data memory, register file, and additional hardware if necessary. You should clearly show the extra hardware and the bit numbers of each connection. Note that PC must be incremented in the first cycle. [10]
- b) Draw the FSM (Finite State Machine) for new lw instruction. [10]





Figure 3:Multi cycle processor